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Abstract 

Using exhaustive Monte Carlo simulations we study the kinetics and mechanism of fibril formation 
using lattice models as a function of temperature and the number of chains. While these models are, 
at best, caricatures of peptides, we show that a number of generic features thought to govern fibril 
assembly are present in the toy model. The monomer, which contains eight beads made from three 
letters (hydrophobic, polar, and charged), adopts a compact conformation in the native state. The 
kinetics of fibril assembly occurs in three distinct stages. In each stage there is a cascade of events 
that transforms the monomers and oligomers to ordered structures. In the first "burst" stage highly 
mobile oligomers of varying sizes form. The conversion to the aggregation-prone conformation occurs 
within the oligomers during the second stage. As time progresses, a dominant cluster emerges that 
contains a majority of the chains. In the final stage, the aggregation-prone conformation particles serve 
as a template onto which smaller oligomers or monomers can dock and undergo conversion to fibril 
structures. The overall time for growth in the latter stages is well described by the Lifshitz-Slyazov 
growth kinetics for crystallization from super-saturated solutions. 
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I. INTRODUCTION 



The link between aggregation of proteins and a number of neurodegenerative diseases 
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231]. Aggregation rates depend not only on protein sequence 



but also on the concentration of proteins and external conditions (temperature, pH, presence of 
crowding agents etc.). The observation that many proteins that are unrelated by sequence and 
structure can aggregate and form fibrils [l^ with similar morphologies (albeit under different 
growth condition) suggests that certain generic aspects of oligomerization and subse que nt fibril 



growth can be gleaned from toy models. Towards this end a number of lattice models [H 
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have been introduced to probe the fibril formation mechanism. Here, following the important 



studies by Hall and coworkers 
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26l |. we use a three-dimensional lattice model that is, in part, 
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inspired by all-atom simulations of oligomer formation of the peptide fragment Af3iQ_22 
provide insights into mechanism of fibril formation. 

Soluble (S) monomeric polypeptide chain can be either random coil-like (A/? peptides or a- 
synuclein) or folded (transthyretin) . Typically, fiuctuations or denaturation stress can populate 
one of several aggregation-prone conformations (N*). Because of conformational variations in N* 
fibrils with differing molecular structure can form starting from the same sequence. However, 
the growth mechanism starting from N* to the fibril state is not fully understood. Three 
mechanisms for fibril assembly have been proposed. In the nucleation-growth (NG) mechanism 
{27! the first step is the oligomerization of sufficient number of N* particles oligomerize and form 
a critical nucleus, which is a free-energetically an uphill process upon forming N*^ (n > the 
size of the critical nucleus ric)- S monomers can rapidly add to the oligomer resulting; in growth 
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of oligomers and eventual fibril assembly. The templated-assembly (TA) process [2 
suggests that preformed N*^ complex, with presumably n > nc, serves as a template onto which 
S or N* can dock and undergo the needed structural arrangement to lock onto the template. 
Based on kinetic data on prion formation in yeasts the nucleated conformational conversion 
(NCC) model has been proposed. In the NCC model it is envisioned that S forms mobile 

disordered oligomers. The monomers in the oligomer undergo S N* conversion to form nuclei 
N*^. The species N*^ can serve as a template and incorporate other (less structured) oligomers 
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or monomers to rapidly form ordered fibrils. The important feature of NCC is that structural 
arrangement S N* NpjB {Nfjb is the monomer structure in the fibril) occurs within the 
molten oligomer. In many cases the structures of N* and N^ib are similar. 

In this paper we study the mechanism of fibril assembly using a simple lattice model for 
which extensive simulations can be performed. The analysis reveals a complex scenario for 
protofilament and fibril assembly that seems to have elements of all the three growth models. 
The dependence of fibril formation time on the number of monomers reveals that late stages 
of growth have a lot in common with crystallization in super saturated solutions. These findings 
arise from detailed Monte Carlo (MC) simulation studies using a toy lattice model in which 
each chain has N ^ 8 beads of three types, namely, hydrophobic (H), polar (P) and charged 
(see Methods). Our simulations show that the overall assembly of ordered protofilaments and 
fibrils occur in three distinct stages. The smallest time scale is associated with a fast "burst 
phase" during which highly mobile oligomers form. During this stage there is a distribution of 
oligomers of varying sizes. Because we are forced to simulate finite number of chains we cannot 
quantify the nature of the size distribution. The second stage is the transformation of the 
burst phase .... into a disordered but compact oligomer in which about half of the interpeptide 
contacts form. It is likely that the conformational transition from S N* takes place during 
this stage as envisioned in the NCC model. The longest time scale corresponds to the final stage 
of fibril formation. In this stage the large clusters grow by incorporating the small clusters. 
The structural transitions here are best described by a dock-lock mechanism that requires the 
presence of a template. Thus, even in the toy model there are complex structural transitions 
that take place in each stage of assembly. It appears that elements of NG, TA, and NCC are 
operative depending on the stage of fibril formation. 

II. METHODS 

Model Each chain consists of N connected beads that are confined to the vertices of a 
cube. The simulations are done using M identical chains with A^=8. The sequence of a chain is 
+HHPPHH-, where + and - are charged beads. The assignment of chemical character and the 
nature of interactions between the beads should be viewed as a caricature of polypeptide chains. 
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and are not realistic representation of amino acids. Despite such drastic simplification it has 
)een shown that lattice models are useful in providing insights into protein folding mechanisms 
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The inter- and intra-chain potentials include excluded volume and contact (nearest neighbor) 
interactions. Excluded volume is imposed by the condition that a lattice site can be occupied 
by only one bead. The energy of M chains is 

M N M N 

^ = XI XI ^smmK^ij - ^) + X Yl ^msm{j)Kri3 - (1) 

1=1 i<j m<l i,j 

where r^j is the distance between residues i and a is a lattice spacing, sm{i) indicates the type 
of residue i from m-th peptide, and 5{0) = 1 and zero, otherwise. The first and second terms in 
Eq. [1] represent intrapeptide and interpeptide interactions, respectively. 

The contact energies between H beads Chh is -1 (in the units of kBT). The propensity of polar 
(including charged) residues to be "solvated" is mimicked using epa =-0.2, where a= P,+,or -. 
" Salt-bridge" formation between oppositely charged beads is accounted for by a favorable contact 
energy e+_ = —1.4. All other contact interactions are repulsive. The generic value for repulsion 

ea(3 is 0.2. For a pair of like-charged beads the repulsion is stronger, i.e. e++ = e = (0.7). 

The chains were confined to the vertices of the three-dimensional hypercube. For example, when 
M = 10 the length of is 10a. Therefore, the volume fraction occupied by the peptides is 0.08, 
and corresponds to the concentration of 250 mM. This is about three orders of magnitude denser 
than that used in typical experiments. 

Simulation details. Simulations were performed by enclosing M chains in a box with periodic 
boundary conditions. We use Monte Carlo (MC) algorithm to study the kinetics of amyloid 
formation. At the beginning of each MC cycle a peptide is selected at random. Then one of 
the two types of MC moves, global or local, is randomly chosen. The acceptance probabilities 
of global and local moves are 0.1 and 0.9, respectively. Global moves correspond to either 
translation of a peptide by a in a randomly chosen direction or rotation by 90^ around one of 
the randomly chosen coordinate axes. The direction of rotation as well as the type of global 
move are selected at random. A local move [35] corresponds to tail rotation, corner fiip, and 
crankshaft rotation. Given the condition that a local move is accepted of 0.9 probability we 
used the same relative probabilities for selecting the particular types of local moves as described 



elsewhere [36] . We measure time in units of Monte Carlo steps (MCS). The combination of local 
and global moves constitutes one MCS. 

Structural probes. Contacts in the aggregated state (oligomer or fibrils) are divided into 
two categories, intrapeptide and interpeptide. If two non-bonded beads (those that are not 
covalently linked) of a given chain are near-neighbors, then they form an intra-chain contact. 
An interpeptide contact in an ordered conformation is one which is (i) formed between beads 
belonging to different peptides, and (ii) the associated peptide bonds are in the ordered state. 
All interpeptide contacts in the fibril structure satisfy the condition (ii), although this is not 
generally the case for an arbitrary oligomeric structure. The numbers of intrapeptide and 
interpeptide fibril contacts in an arbitrary conformation are denoted as Qm and Qf with Qm,o 
and (5/,o being their values in the fibril state. In what follows, quantities with the subscript 
correspond to the fibril structure. 

In order to probe the growth of the fibril we obtained the distribution of fibril clusters in a 
given oligomer conformation. A fibril cluster is computed by selecting a pair of fibril contacts 
and adding adjacent fibril contacts, whose peptide bonds are parallel or antiparallel to the bonds 
associated with original fibril contact pair. The growth of fibril cluster continues until no more 
fibril contacts can be added to the cluster in any direction. A typical oligomer contains several 
fibril clusters of different sizes that are measured by the number of incorporated fibril contacts. 
The number of fibril contacts in the largest cluster is denoted by Q/c- In the fibril structure, a 
single fibril cluster consumes all residues and all chains, and hence Q/c = Qf- 

We have also computed the number of interpeptide contacts (of any type), Cout^ which de- 
scribes the formation of the aggregated state. Aggregation of chains is also monitored by com- 
puting the distribution of oligomers. An oligomer is defined as a group of aggregated chains. 
Two oligomers are distinct, if none of the chains from one oligomer interacts with any chain from 
the other. A given multichain conformation may contain several oligomers and their number, 
A^o, is useful to characterize the process of aggregation. In addition, the number of peptides in 
the largest oligomer Np is computed. As aggregation progresses Np approaches M. 

Kinetics of assembly: To follow the kinetics of aggregation an initial distribution of M random 
peptide structures is generated, and equilibrated at high temperature (T = 3.0) for 10^ MCS. 
The resulting distribution of chains is used as a starting point for initiating fibril assembly which 
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begins by quenching the temperature to Tg (< 3.0). Each MC trajectory starts with a unique 
distribution of chains. The total number Nmc of MC trajectories for a given varies from 
100 to 400. The first instance, when the fraction of intrapeptide and interpeptide fibril contacts 
exceed 0.85 is associated with the first passage time TfUj^i for fibril assembly for a trajectory 
i. The condition 7(= 0.85) which is a fraction of intra- and inter-chain fibril contacts, was 
chosen empirically by analyzing numerous MC trajectories. The mean time of fibril assembly is 
computed by fitting the yield of the fibril structure Pf{t) in the pool of Nmc independent MC 
trajectories. 

Rapid nucleation of fibril structure was analyzed as follows. For each trajectory we considered 
an interval of 10^ MCS immediately preceding TfUj^i and computed various quantities associated 
with fibril formation as described above. In addition, within the time interval Tf^^i — 10^ <t< 
^fib,i we considered the subset of fibril contacts in the largest fibril cluster Q/c, which satisfy 



two conditions 



37, 



38l |. namely, (i) that these fibril contacts are formed at the time of fibril 



assembly Tfi^^i and (ii) that apart from short lived disruptions they remain stable within the 
interval (t^Tfu^^i). The disruptions of fibril contacts must not exceed t = 2000 MCS. The results 
do not depend on the specific value of t when it is varied by ±1000 MCS. The fibril contacts 
satisfying these two conditions are referred to as "nucleation" fibril contacts and their number 
is denoted as Qnfc 



III. RESULTS AND DISCUSSION 



Monomeric and fibril structures 

Monomer. Exact enumeration of all possible conformations of the monomer of 8 beads shows 
that there are 18 energy levels. Three lowest levels in the spectrum are presented in Fig. [H The 
monomeric native state is compact, and it has the lowest energy E = —3.8. It should be noted 
that the conformation of the chain in the fibril state is not compact and it belongs to the first 
excited state (label N* in Fig. [1]) which is four-fold degenerate. Fluctuations in the monomer 
conformations has to populate the structure with E = —3.4 for oligomerization to start. Such 
fiuctuations, under condition when the native structure is stable, can occur spontaneously or 
through inter-chain interactions. Clearly, suppression of fiuctuations at low temperatures would 
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slow down the process of ologimerization. The toy model captures the well-accepted proposition 
that aggregation requires partial unfolding of the native conformation [39] . 

Ensemble of peptides. When multiple chains are present in the unit cell, aggregation is 
readily observed, and in due course they form ordered structures (Fig. Ej). Exact enumeration 
of all conformations for multi-chain systems is not possible so that the structure of the lowest 
energy has to be determined using simulations. We used the MC annealing protocol, which 
allows for the exhaustive conformational search, to find the lowest energy conformation. In the 
ordered protofilament (M = 10) and fibril (M = 16) structures the chains adopt an antiparallel 
arrangement (Fig. [2j). 

The nature of ordering changes depending on M, and hence the concentration. For M < 10 
the chains are arranged in a single layer while for M > 10 the fibril state has a double-layer 
arrangement (Fig. [2j). Just as noted, using all-atom molecular dynamics simulations {4^, the 
organization of chains in the fibril satisfies the principles of amyloid self assembly (PASA) 
which states fibril structures are determined by maximizing the number of salt bridges and 

n 

hydrophobic contacts [40|]. In accord with PASA, we found that the organization of the lowest 
energy structure demonstrates a remarkable order leading to the maximization of favorable 
electrostatic and hydrophobic interactions (Fig. 2). All H (in green) beads located in the core 
of the fibril are sandwiched between exposed layers of P (in yellow) , and charged beads (in blue 
and red). More importantly, all peptides adopt in-registry antiparallel mutual orientation, which 
implies that for all bond vectors connecting nearest neighbor pairs of residues {i^ N — i + 1) and 
{i + 1^ N — i) f^i^i = —r^N_i^N-i+n where m and / are the peptide indices. The antiparallel 
arrangement is enforced by favorable electrostatic interactions. Fig. [2] shows that the nearest 
neighbors of all negatively charged terminals (in red) are positively charged beads (in blue). 

For M = 10, in all there are 84 interpeptide fibril contacts and 30 intrapeptide contacts and 
the entire protofilament structure in Fig. [2^ comprises a single layer. This implies that a given 
interpeptide antiparallel in-registry arrangement of chains is translated across the entire volume 
of the fibril in all directions. It is interesting that all intrapeptide contacts are also found in 
the native conformation of the monomer (lowest energy conformation in Fig. [TH) and the N* 
structure (Fig. [TH). Due to diflFerent possible distributions of peptides within the volume of a 
fibril the lowest energy fibril structure has non-zero entropy. 
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The fibril contains both interpeptide and intrapeptide interactions. The structure of the 
monomer in the ordered fibril coincides with one of the structures that is higher in energy 
than the native monomer conformation (conformation N* in Fig. [1]). Because the fibrils are 
associated with aggregation of unfolded structures (here the first excited state in the spectrum 
of allowed monomer conformation), it is logical that other morphologies that nucleate from 
different unfolded conformations can form. By scanning the sequences for = 8 we could 
not produce fibrils starting from high energy monomer conformations which highlights one of 
the limitations of the lattice model. This observation suggests that as long as peptide sequence 
contains hydrophobic patches and oppositely charged residues distributed along the sequence the 
fibril structure is likely to include a mixture of inter- and intrapeptide interactions . Combination 
of inter- and intrapeptide contacts maximizes the number of hydrophobic and salt bridges thus 
satisfying the PAS A. 

There are superficial similarities between structures in Fig. [2] and the model proposed for 
A/?i_4o whose sequence is interspersed with charged and hydrophobic residues. The amyloidgenic 
A/? peptide contains two hydrophobic regions (central hydrophobic cluster and the C-terminal) 
as well as charged residues. Proposed fibril model for A/?i_4o is based on the assumption that 
an A/?i_4o monomer contains a turn, which brings two hydrophobic regions in proximity and 
facilitates formation of a salt bridge [4l|. 

Time scales for monomer folding and fibril assembly 

The short chain (A = 8) allows us to compute the times Tp for monomer folding as a function 
of temperature. The decay of the population of unfolded conformations is best described using 
a single exponential (data not shown) which is characteristic of well designed sequence. The 
folding time Tp is well below 10^ MCS (Fig. [3]) over a wide temperature range. In contrast, the 
temperature-dependent time for fibril formation, is dramatically different (Fig. [3]). There 
are two striking observations about First, is about 4-6 orders of magnitude larger than 

Clearly, the sizes of the monomer and the fibril can cause the vastly greater value of 
compared to Tp. The eflFect of system size can be roughly rationalized using the approximate 
dependence of Tp on N 42]. It has been shown that Tp ^ Tpoe^'^^ [3]. Assuming that Tpo does 
not change significantly and taking into account that the fibril in our model is 10 times larger 
then the monomer size consideration alone would yield Tfiij/rp ^ 10^. In addition, formation 
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of fibril (or protofibrils) also requires collective fiuctuation (formation of nucleus for example) 
which requires that several monomers access the N* structure in the first excited state of the 
isolated monomer (Fig. [1]). There are barriers associated with such processes that also increase 
Tfii). The relative values ^ (10^ — 10^)tf is not inconsistent with experimental observations. 
Typical values of tf for small proteins is about (1 - 100) ms. Thus, our simulations would 
suggest Tfib ^ (10^ — 10^) sec assuming ri? 10 ms. 

The most striking aspect of Fig. [3] is the dramatic differences in the T-dependence of t>(T) 
and Tfii){T). The temperature independence of tf{T) in the 0.3Tp <T< 1.3Tp is typical of well- 
designed monomer sequences for which Tp ^ Tq^ where Tq is the collapse transition temperature 
In contrast, Tfi}j{T) changes drastically as T varies. In the narrow temperature range 
{Tp <T< lATp) Tfii) varies by almost two orders of magnitude. At the temperature T ^ l.STp 
(Fig. [3]), when is the smallest, the native structure is less stable than the unfolded ensemble. 
The structures of the partially unfolded conformations at T ?^ l.ST^ shows that the probability 
of the "salt bridges" (intramolecular contact between + and - beads) being in contact exceeds 
0.5. At T ^ l.STp there is substantial probability of populating the aggregation-prone monomer 
(Fig. lb) that acts as a seed for nucleation and growth. At T ^ 1.3Tp the fibrils form in the 
smallest time with 100% yield whereas at T = Tp the yield of the fibril drops to 0.42 during the 
simulations lasting of 10^ MCS. 

The observation that partial unfolding of the native state is a necessary condition for ologimer- 
ization and fibril growth is consistent with experimental observations that many non-homologous 
protein sequences assemble into amyloid fibrils under denaturing conditions j45|. Although the 
formation of fibrils is apparently a generic feature of polypeptide sequence, our simulations 
suggest that for a given sequence there may be only a narrow window of external conditions 
that favor rapid fibril assembly. Besides requiring that the native monomer partially unfolds 
for aggregation to begin, the denaturing conditions must also be relatively mild. Under these 
conditions aggregation-prone structures with intramolecular native interactions that moderately 
stable can be populated. In our model the conformation that nucleate and grow (Fig. [1]), is 
homogeneous which results in a unique fibril structure. Denaturing conditions that favor its for- 
mation, with intact "salt bridges" results in the most rapid assembly (Fig. [1]). In polypeptide 
chains there may be a collection of conformations that can lead to fibrils. The differences in fibril 
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morphology is probably linked to the variations in the initial conformations of the monomer. 
Fibril assembly occurs in three major stages 

Formation of proto filaments: To provide microscopic details of fibril assembly we generated 
multiple MC trajectories for M = 10 at = 0.65 = 1.3Tp at which is the smallest (Fig. 
[3j). In all, 100 MC trajectories starting from random initial conditions were generated. The 
length of MC trajectories (8x10'' MCS) at Tg was sufficiently long to observe ordered structure 
formation in each trajectory. Fig. H] displays several quantities averaged over 100 trajectories 
and normalized to vary from 1 (at t = 0) to (the equilibrium value). The averaging over the 
ensemble of trajectories is indicated by angular brackets < >. The timescales from exponential 
fits to these functions describe the kinetics of fibril formation. Analysis of the various time 
dependent quantities and inspection of the structures sampled enroute to the final fibril gives 
an intuitive picture of assembly and growth. 

Immediately after temperature quench to T^, the chains are randomly distributed in the 
unit cell. The numbers of intra- and interpeptide fibril contacts are negligible, and there are 
relatively few interchain interactions. The largest oligomer contains, on an average, four chains 
{Np=4:). Within a short time the inter-chain interactions trigger the formation of oligomers 
which represent the growth stage in the route to fibrils. Fig. shows that the average number 
of free chains < Nf^ee > (those which do not make interpeptide contacts) is less than one in 
^ 0.03 X 10^ MCS or O.Olrj^^. Almost concurrently, the number of peptides in the largest 
oligomer < Np{t) > exceeds nine. Thus, already in the initial stage the chains interact and 
cooperatively form fiuid-like oligomers. Indeed, < Np{t) > grows on the time scale of 0.06x10^ 
MCS or 0.02r/^5, and approaches its equilibrium value of 9.8. Therefore, virtually all chains are 
incorporated in a single burst phase leading to mobile oligomer formation. 

The second stage in fibril assembly is associated with the formation of intra- and interpeptide 
interactions, which transforms the mobil oligomer formed in the first stage, into compact disor- 
dered oligomer. During this stage structural rearrangement and conversion from S N* take 
place as shown by a number of quantities. The intrapeptide fibril contacts < Qm{t) > (data 
in blue in UK) are formed on the timescale of O.lrj^^. On a similar time scale, the number of 
interpeptide contacts < Cout{t) > (data in green) approaches the equilibrium value of approxi- 
mately 67. Interestingly, the number of distinct clusters < Nfc{t) > reaches maximum during 
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this stage of fibril assembly (data not shown). We surmise that the disordered oligomer contains 
as many as four distinct fibril clusters, the largest of which already comprises roughly 50% of 
the entire protofilament. Fig. UK further demonstrates that at t the distribution of the 

volume of fibril clusters extends from predominantly small clusters {Qfc ^ 14) to larger ones 
(15 < Qfc ^ 28). The total number of fibril contacts is still relatively small in the disordered 
oligomer (< (5/(0. Sr/^^) >^ 30 = 0.36(5/c,o)- Therefore, disordered oligomers are characterized 
by a nascent single layer protofilament-like structure (Fig. [2^) , which emerges in the oligomer 
volume as a distribution of disjoint fibril clusters of varying sizes. 

The transformation of disordered oligomers to an ordered structure occurs during the third 
stage of fibril assembly. It follows from Fig. UK that the timescale for the formation of inter- 
peptide fibril contacts < Qf{t) > is 0.5x10^ MCS or ^ 0.2r/^5 (data in red). Importantly, on 
the same time scale the dominant fibril cluster grows as shown by < Qfc{t) > (data in orange). 
This result indicates that the formation of fibril structure occurs via the growth of the largest 
fibril cluster at the expense of small clusters. The winner-take-all scenario of fibril growth is 
further described below. The number of fibril clusters < Nfc{t) > decreases to less than 3 in the 
time interval of 0.2r/^5 < t < Tf^,. On the other hand, the maximum in the kinetic distribution 
of the fibril structure among the clusters shifts to the right signaling the emergence of large 
clusters (43 < Qfc ^ 70). By assigning weight in proportion to the size of fibril clusters we find 
that the dominant fibril cluster comprises almost the entire fibril structure. In accord with this 
conclusion we found that the fraction of fibril contacts (i.e., the fraction of fibril structure) in 
the largest clusters is 4:3 < Qfc < 70 (results not shown). It is clear that at t > 0.4r/^5 more 
than 80% of ordered structure is localized in a single large fibril cluster. Because on these time 
scales < Nfc{t >) ^ 2, the remaining 10 to 20% of fibril contacts are found in a much smaller 
satellite fibril cluster. 

The formation of a dominant cluster containing the protofilament also follows from the cal- 
culations of thermodynamic quantities. The thermal averages of the number of fibril contacts 
< Qf > and the number of fibril contacts in the largest fibril cluster < Qfc > are 52 and 
47, respectively. Thus, < Qfc >= 0.90 < Qf >. After the dominant fibril cluster appears at 
t ^ 0.4r/^5, its further growth and consolidation continues until it reaches its equilibrium size 
(about 60% of all fibril contacts). This kinetic phase can be described by additional time scale 
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with small amplitude. Due to this additional fibril ordering the final fibril assembly takes place 
only at = 3.3 x 10^ MCS. Thus, long after the formation of the largest cluster structural 
reorganization continues until the ordered stable fibril forms. The slow templated-assembly 
within the large cluster is remimiscent of the lock phase. 

Mechanism of fibril assembly: In order to probe the mechanism of fibril formation (two-layer 
structure in Fig. 2b), at Tg = 0.7, we generated 100 trajectories with each are being 10^ MCS. 
The mean time for fibril formation is ?^ 2 x 10'' MCS. These long runs ensure that the 
fully ordered state is reached in each trajectory. Qualitatively, the fibril formation kinetics 
is the same as in the M = 10 case, i.e., it follows three-stage kinetics. However, there are 
a few quantitative differences. In the protofilament formation case the interpeptide contacts 
< Cout{t) >, and intrapeptide fibril contacts < Qm{t) > (Fig. Hb) are formed on the same time 
scale. For M = 16 (Fig. Hb) < Cout{t) > approaches the value of 0.5 earlier. Fit of < Cout{t) > 
using a sum of three exponential functions gives ri = 0.15 x 10^ MCS ^ O.Olr/^5, r2 ^ 10^ MCS 
^ 0.05r/^5, and ^ 11.2 x 10^ MCS ^ 0.5r/^5. Thus, ri is a characteristic time scale of the 
"burst phase" in which fiuid-like clusters form. On this time scale only a few interpeptide fibril 
contacts Qf 0.6% of total contacts) are formed and the largest oligomer contains, on an 
average, only five peptides {Np=5). Using the three-exponential fit and data presented in Fig. 
5a one can show that the formation of the largest cluster occurs on time scale of ^ 0.02r/^5. The 
number of peptides in this cluster approaches 15 (Fig. (Hi) whereas the number of free peptide 
becomes zero. Almost simultaneously the number of distinct fibril clusters < Nfc{t) > reaches 
a maximum (data not shown). 

The second stage of fibril assembly, in which the burst phase oligomer is transformed into 
a compact disordered oligomer, takes place on the times scale T2 ^ 0.05r/^5. Due to the larger 
value of M this time is larger than for M = 10 . At this stage 50% of equilibrium values of 
the intra- (Qm) and interpeptide (Qf) fibril contacts are formed. Contrary to the M = 10 case, 
fibril contacts in the largest cluster Qfc are formed earlier than total Qf. This is probably due 
to increasing role of the satellite clusters as the number of monomers increases. On long time 
scales we have more than two and less than two such clusters for M = 16 and 10, respectively. 
The "winner- take- all scenario" is also valid for the M = 16 system because for t > 0.2rj^5 the 
largest cluster contains ^ 75% of fibril contacts. These observations are made quantitative using 
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the dependence of ^ M (see below). 

As seen from Fig. EH, the three exponentials (/(t) = fo — /iexp(— t/ri) — /2exp(— t/r2) — 
/3 exp(— t/r3)) fit the data well (dashed line). Here, we have three different time scales ri ^ 
0.17 X 10^r2 ^ 1.24 x 10^ and n ^ 12.18 x 10^ MCS (the partition of these phases is /i ^ 
0.19, /2 ^ 0.46 and fs ^ 0.1). Experiments [28] on the fibril growth kinetics of A/?-peptides, 
that is fit using a sum of two exponential functions, have been interpreted in term of templated- 
assisted "dock-lock" mechanism. From the perspective of the present studies we conclude that 
such a mechanism is probably valid during the second and third stages of fibril growth. The 
lock phase during which in-registry arrangement of the chains takes place, clearly occurs only 
during the last part of stage three in the fibril growth process. The early stages of growth reveal 
a much more complex set of events in which physical process described in NG and NCC are 
manifested (see also the Concluding remarks). 

Dependence of fibril formation time on number of monomers. 

In order to obtain the dependence of on number of monomers, we fixed the monomer 
concentration and computed Tfn, for each system at T^. The fibril formation time scales linearly 
with the number of monomer (Fig. 5b), Tfn, ^ M but with different slopes for M < 10 and 
M > 10. This is probably related to difference between protofilament and fibril formation (see 
Fig. [2]a and [2]6) The linear dependence of on M supports the template-assisted mechanism 
in which monomers are added one by one to preformed ordered structures (protofibrils or fibrils) 
provided the number of these monomers exceeds the size of critical nucleus. Thus, the linear 
dependence characterizes growth only during the late stages of ordered assembly. Our results 
agrees with experimental findings of Kowalewski and Holtzman [46] who studied aggregation of 
Alzheimer's /?-amyloid peptides on hydrophilic mica and hydrophobic graphite surfaces as well 

n 

as with the results obtained by Collins et al [47] for the amyloidogenic yeast prion protein 
Sup35. 

Interestingly, the dependence of on M for such a complicated process as fibril assembly 
seems to follow the well-known Lifshitz-Slyzov law. Since M ^ L^, where L is a typical size of 
the ologimer, we obtain ^ L^^^ which is the Lifshitz-Slyzov law [48] describing the growth of 
a cluster in a supersaturated solution. The finding in Fig. 5b further supports the "winner-take- 
over" scenario for oligomer growth because the Lifshitz-Slyzov law is based on the assumption 
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that the largest cluster grows at the expense of smaller ones. 

IV. CONCLUDING REMARKS 

We have used a lattice model to elucidate the generic features of fibril assembly mechanisms 
in proteins. Using this toy model many aspects of the transitions from the monomer to fully 
formed fibrils can be monitored. Examination of the kinetics of the assembly process reveals 
that several aspects of complex set of transitions seen in the simple model is also qualitatively 
observed in experiments. 

1. The ordered fibrils form as the number of chains become greater than critical value. In 
our system we find that for M = 16 a stable two layer fibril is formed which is perhaps 
the minimum replicating unit in the infinite fibril. For smaller M (Fig. 2a) ordered 
protofilaments are the lowest energy conformation. It is likely that there are substantial 
internal rearrangements of the chains as the number of monomers increases so that a 
stable fibrils can be populated. Although, we did not carry out systematic calculations to 
infer the size of the critical nucleus it appears both from the temperature dependence of 
protofilament formation as well as the ease of fibril production for M = 16 that the size 
of the nucleus has to be less than 10. 

2. The kinetics of fibril formation occurs broadly in three distinct stages. In the initial stage, 
the chains rapidly partition into clusters of varying sizes. Because of finite size limitations 
we are unable to determine the precise distribution of cluster sizes. The chains within each 
cluster is mobile and fiuid-like. There are, in all likelihood, substantial conformational 
fiuctuations within each cluster. In the second stage the chains in the clusters form a 
number of intra- and inter-chain contacts that leads to the disordered oligomers. During 
this stage bigger clusters grow at the expense of smaller ones. In the process protofibrils 
in which many peptide adopt the eventual conformation in the fibrils form. In the third 
post-nucleation stage the chains add to the largest (single) cluster. In this stage, which 
is captured in experiments, the addition of a monomer occurs by a lock-dock mechanism. 
Thus, a cascade of events starting from conformational fluctuations in the monomer that 
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populate the aggregation-prone conformation (Fig. la) through a series of inter-peptide 
interaction-driven conformational changes results in fibril assembly. 

3. The growth kinetics depends on the depth of quench AT = (T^ — T^), where is the 
initial temperature at which the chains are brought to equilibrium. When the depth of 
quench is large then there appears to be a lag-time before the fibrils are populated. In this 
case the ordered structures form in a highly cooperative manner. In contrast, when the 
growth process is initiated by equilibrating the monomers at the final growth temperature 
(AT = 0) then the fibril growth occurs in a continuous manner and is less cooperative 
(Fig. 6 ). Because the aggregation-prone structure is unique in the toy model we do not 
observe variations in the morphology of the final fibril structure. This is surely an artifact 
of the lattice model. 

4. The temperature dependence of r^^ for M = 10 shows Arrhenius behavior with r^^ ^ 
e2cp(— £'^//cbT) (see inset in Fig. 3). This is in qualitative agreement with experiments 



50[ |. In addition, collective rearrangement of several chains from the S to the N* 
structure that occurs within the oligomer becomes slower at low temperatures. These two 
factors contribute to the barrier that leads to substantial increase in as T is lowered. 

5. The mechanism of assembly of fibrils even in this toy model is highly complex. While the 
overall growth kinetics can can be summarized using a three stage growth the events that 
transpire in the distinct stages involve large structural transitions. In the initial "burst 
phase" loosely bound clusters form in which the chains are essentially " non-interacting" . 
In the second stage stable clusters with considerable inter-particle interactions form. There 
is a distribution of oligomers. Due to finite size of the simulations the nature of distri- 
bution is unclear. It is within these oligomers, in which the chains are in a mixture of 
S-like and the aggregation prone N*-like states, the conversion from S to N* takes place. 
These transitions result in formation of large-enough ordered oligomers that can serve as 
templates for conversion of additional monomers or oligomers to form mature fibrils. It is 
the last stage that is best described by the dock-lock mechanism. 

6. Strikingly, the growth of mature fibrils in the third stage occurs by the Lifschitz-Slyazov 
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mechanism in which the largest clusters grow at the expense of smaller ones. The pro- 
posed mechanism supports the physical picture that S ^ N* transition occurs either in the 
oligomers (NCC model) or upon addition to preformed ordered template (dock-lock mech- 
anism). Thus, we find that elements of the three models (NG, TA, and NCC) are found 
in each assembly stage. This conclusion also supports a detailed study of fibril growth in 
off-lattice model of poly-alanine jsij] in which multiple routes to fibril formation was found 
even in the final stages of incorporation of ordered structures or disordered monomers. 
Finally, the proposed Lifschitz-Slyazov growth law strongly suggests that seeding with 
preformed fibrils should lead to rapid growth because such large structures can incorpo- 
rate disordered oligomers on time scales that vary linearly with peptide concentration. 
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Figure captions 



FIGURE 1. (a) Energies and structures of some of the conformations of the monomer using 
exact numeration. Hydrophobic, polar, positively and negatively charged beads are shown in 
green, yellow, blue, and red, respectively. There are a total 1831 possible conformations that are 
spread among 18 possible energy values. The non-degenerate native conformation is separated 
from degenerate higher energy conformations. The structure enclosed in the box is the one that 
the chain adopts in the fibril, and is referred to as N*. The second highest energy structures are 
also four-fold degenerate, (b) The probability P/v* of populating the aggregation-prone structure 
N* as a function of T. The arrow indicates the temperature at which P/v* is maximum. 

FIGURE 2. a) The lowest energy structure for ten monomers (M = 10). The chains are 
arranged in an antiparallel manner. The structure of the monomer is the same as the N* 
conformation in Fig. la. Beads are colored in the same manner as in Fig. la. This single 
layer structure is a protofilament. b) The double layer structure of M = 16. As in fibrils of 
polypeptides the "/?-sheet like" monomers are arranged perpendicular to the fibril axis which 
lies parallel to the "salt-bridge" plan (contact between blue and red). Thus, the protofilament 
and the fibril are stabilized by hydrophobic interactions and salt-bridges. 

FIGURE 3. The temperature dependence of monomeric folding time tf (open circles) and the 
time for protofilament assembly Tfn, (squares) for M = 10. Temperature is given in the units of 
the monomer folding temperature Tp = 0.5. This value of Tp is obtained using the condition 

< Q{Tf) >— 0.5, where < QiTp) > is the fraction of native contacts. The inset shows rj^ at 
low temperatures as a function of 1/T for M = 10. 

FIGURE 4. (a) Time dependence of structural quantities probing the formation of fibril struc- 
ture. The number of intrapeptide fibril contacts < Qm{t) >-> the number of interpeptide contacts 

< Coutit) >5 the number of fibril contacts < Qf{t) >, and the number of fibril contacts in the 
largest fibril cluster < Qfcif) > are shown in blue, green, red, and orange, respectively. The 
data are averaged over 100 trajectories, and smooth lines represent the biexponential fits to the 
data. The fraction of trajectories in which the fibril structure is still not reached, Pu{t)^ is shown 
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in black, (b) The time dependence of the number of peptides in the largest oligomer < Np{t) > 
and the number of free peptides < Nfreeif) >• (c) Same as in (a) except the results for M = 16. 
(d) Same as (b) but for M = 16. 

FIGURE 5. (a) Time dependence of the fraction of fibril structure for M = 16 and T = 0.7. 
The dashed curve corresponds to fit of the simulated data using a sum of three exponentials, 
(b) Dependence of as a function of M. The change in the slope for M > 10 corresponds to 
the transition from profilament (single layer) to fibrils (double layer). 

FIGURE 6. Dependence of the fibril fraction f{t) for the pentamer (M = 5) at T = 0.4 
(< Tp). The initial conformations for the high T-quench were generated by equilibrating the 
pentamer for 10^ MCS at T = 2.0. Subsequently assembly of the fibril fragment is initiated by 
quenching the temperature to Tg — 0.4. In the low-T quench regime the initial configurations 
were generated in the same way but equilibration was done at T = 0.4. Typical snapshots at 
various times during the fibril growth are shown. 
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